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Abstract 



We offer a new proof of Furstenberg and Katznelson's density version of 
the Hales-Jewett Theorem: 

Theorem. For any 5 > there is some Nq > 1 such that whenever A C 
[k\^ with N > Nq and \A\ > 6k^, A contains a combinatorial line: that 
is, for some I C [N] nonempty and wq & we have 

Ad {w : 'w\[N]\i = Wo, w\i — const.}. 

Following Furstenberg and Katznelson, we first show that this result is 
equivalent to a 'multiple recurrence' assertion for a class of probability mea- 
sures enjoying a certain kind of stationarity. However, we then give a quite 
different proof of this latter assertion through a reduction to an infinitary re- 
moval lemma in the spirit of Tao ["241 (and also its recent re-interpretation 
in [3J). This reduction is based on a structural analysis of these stationary 
laws closely analogous to the classical representation theorems for various 
partial exchangeable stochastic processes in the sense of Hoover IITtI. A1- 
dous [12] and Kallenberg 1 18 1. However, the underlying combinatorial ar- 
guments used to prove this theorem are rather different from those required 
to work with exchangeable arrays, and involve crucially an observation that 
arose during ongoing work by a collaborative team of authors \2l i to give a 
purely finitary proof of the above theorem. 
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1 Introduction 

In this note we record a new proof of the following result of Furstenberg and 
Katznelson: 

Theorem 1.1 (Density Hales-Jewett Theorem). For any 5 > there is some Nq > 
1 such that whenever A C [k]^ with N > Nq and \A\ > 6k^, A contains a 
combinatorial line: that is, for some I C [A^] nonempty and wq G [A;][^]\^ we 



This is the 'density' version of the classical Hales-Jewett Theorem of colouring 
Ramsey Theory. The Hales-Jewett Theorem is one of the central results of Ramsey 
Theory, partly because many other results in that area can be deduced from it (see, 
for example. Chapter 2 of Graham, Rothschild and Spencer [16]). Likewise, the 
above density variant generalizes many other results in density Ramsey Theory, 
such as the famous theorem of Szemeredi ( ll22l ) and its multidimensional analog, 
also proved by Furstenberg and Katznelson ( ifTTTl '). 

Following Furstenberg 's discovery in f9l of an alternative proof of Szemeredi 's 
Theorem via a conversion to a result in ergodic theory, the use of ergodic-theoretic 
methods to prove results in density Ramsey Theory has become widespread and 
powerful (see, for example, the survey of Bergelson Q), and the above result was 
one of the furthest-reaching consequences of this program. In this paper we follow 
Furstenberg and Katznelson as far as their re-interpretation of the above result in 
terms of stochastic processes, but then we offer a new proof of that version of the 
result. 

In order to state the result about stochastic processes into which Furstenberg and 
Katznelson convert Theorem ll.il let us first define [k\^ to be the infinite-dimensional 



have 



AZ^ {w : w|[/yr]\7 = wq, w\i = const.}. 
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combinatorial space over the alphabet k: 

[kr := \j[kr. 

n>l 

Theorem 1.2 (Infinitary Density Hales-Jewett Theorem). For any 6 > 0, if fj, is a 
Borel probability measure on {0,1}^" /or w/i/c/j 

^{x G {0, : x^ = l}>5 yw£ {0, 1}^" 

then there are some N > 1, I CI [N] nonempty and wq G [A;]^^!^^ such that 

n{x G {0, l}''^'^ : Xyj = 1 \/w G [k]^ s.t. t/^|[Ar]\7 = Wq, w\j = const.} > 0. 

It is shown in Proposition 2. 1 of ifTSll that Theorems 11.11 and 11.21 are equivalent. 
Here we will assume this first step of Furstenberg and Katznelson and concentrate 
on proving Theorem 11.21 and will also follow their next step (Lemma l4!2] below) to 
reduce our study to a class of 'strongly stationary' probability measures. Unlike the 
earlier settings of ergodic Ramsey Theory, this stationarity condition is not read- 
ily described by a collection of invertible probability-preserving transformations 
(in particular, it is instead described in terms of a very large semigroup of highly 
non-invertible transformations, which cannot easily be made invertible by passing 
to any simple extended system while respecting the relations of the semigroup). 
Consequently Furstenberg and Katznelson must next impose a collection of invert- 
ible transformations 'by hand' that describes only a rather weaker subsemigroup 
of symmetries, and then bring modifications of their older ergodic-theoretic tech- 
niques (see, in particular, lITOl and llT2l ') to bear on these. 

Here we avoid the introduction of these transformations, and give an analysis 
purely in terms of the strong stationarity obtained initially. This has much more in 
common with many of the basic studies of partially exchangeable arrays of random 
variables, particularly by Kingman, Hoover, Aldous and Kallenberg: see, for ex- 
ample, the recent book of Kallenberg |fT9l and the references given there (and also 
the survey ||6l, which treats this subject in a very similar formalism to the present 
paper and also describes the relations of those developments to other combinato- 
rial results in extremal hypergraph theory). Ultimately we reduce the problem to 
an application of the 'infinitary hypergraph removal lemma' of Tao |[23l (or, more 
precisely, of a cut-down corollary of that lemma first used in 131 to give a very 
similar new proof of the Multidimensional Szemeredi Theorem). 

We will prove our main structural result for strongly stationary laws as an asser- 
tion that any such law is a 'factor' of a law with a particularly simple structure 



3 



(similar to the structure of the joint distribution of all the ingredients that are intro- 
duced for the representation of an exchangeable array), and then this structure will 
give the reduction to an infinitary removal lemma. This 'simple structure' will be 
introduced in the definition of 'sated' laws in Section [5]below. 

We note here that bringing this general program to beai" on the task of proving 
Theorem 11.21 would not have been possible without a crucial insight that recently 
emerged from an ongoing open collaborative project of Bukh, Gowers, Kalai, Mc- 
Cutcheon, O'Donnell, Solymosi and Tao. Their ultimate goal was a purely finitary, 
combinatorial proof of Theorem 11.11 and as the present paper neared completion 
this also seemed to have been realized; these developments can be followed on- 
line ( 11211 '). The critical observation that we have taken from their work drives our 
proof of Theorem l6.2l below. but it has been translated into a very different lexicon 
from the finitary work of f2l] and we do not attempt to set up a full dictionary here. 
I am also grateful to Tim Gowers and Terence Tao for helpful suggestions made 
about earlier drafts of this paper. 



2 Some background from combinatorics 



We write [N] to denote the discrete interval {1,2,..., N} and VS to denote the 
power set of S. 

Most of our work will consider probabilities on product spaces indexed by the 
infinite-dimensional combinatorial space [k]^ introduced above. We will denote 
the concatenation of two finite words u,v £ [k]'^ by either uv or u (B v. For any 
fixed finite n we can define an n-dimensional subspace of [k]'^ to be an injection 

(j) : [k]"^ ^ [k]'^ specified as follows: for some integers = A'^q < -^i < -^2 < 
. . . < iV„, nonempty subsets h C [Ni], h C [N2] \ [iVi], ^ [iV„] \ [iV„_i] 

and fixed words wi € [k]^'^ , W2 € [k]^^, .. .,Wn G [/c]^" we let 4'{viV2 ■ ■ ■ Vn) be 
the word in [kY of length Nn such that when Ni < m < A^j+i we have 



{viV2 ■ ■ ■ Vn)r 



Wm iime{Ni + l,Ni + 2,...,Ni+i}\Ii+i 

Vi ifmGlj+i. 



In these terms a combinatorial line is simply a 1-dimensional combinatorial sub- 
space. 

Similarly, an infinite-dimensional subspace (or often just subspace) of [kY is an 

injection (j) : [k]^ ^ [k]^ specified by the above rule for some infinite sequence 
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= iVo < A^i < iV2 < . . . and nonempty /j+i C [iVj+i] \ [Ni]. It is clear that the 
collection of all subspaces of [k]'^ forms a semigroup under composition. 

Finally, let us define letter-replacement maps: give i E [k] and e C [k], for each 
iV > 1 we define rf^ : [k]^ ^ [k]^ by 



for m < N, and let 



i if £ e 

Wm if G [/c] \ e 



Af>l 

(so clearly re,j actually takes values in the subset ([A;] \ (e \ {i}))'''''^ 



3 Some background from probability 

Throughout this paper {X, S) will denote a standard Borel measurable space. We 
shall write {X^ , S®^) for the usual product measurable structure indexed by a set / 
and //®^ for the product of a probability measure fi on {X, S). Given a measurable 
map (j) '■ {X, S) —>■ {Y, <I>) to another standard Borel space, we shall write for 
the resulting pushforward probability measure on {Y,^). We will generally use vrj 
to denote any coordinate projection from a product space X^ onto its factor X'^ 
for any I ^ J, and will shorten ir^jj to vTj. 

Most of our interest will be in probability measures on the product spaces (X^'^l" , ) 
for various standard Borel spaces {X, S). In this paper we will simply refer to these 
as laws, in view of their interpretation as the joint laws of X-valued stochastic pro- 
cesses indexed by [k]^. Let us note here that Theorem 11.21 is clearly equivalent 
to the following superficially more general result, whose formulation will be more 
convenient for our proof. 

Theorem 3.1. For any 6 > 0, if {X, S) is a standard Borel space, fi is a law on 
(XW", S^W") andAGX is such that ^(7r-^(A)) > 6 for every w G [k]'^ then 
there are some m > 1 and a combinatorial line i -.[k] [k]^ such that 

k 
i=l 
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If /i is a law and cl) : [k]^ ^ [k]^ is a subspace, then the projected law (vriniago(</)))#A* 
on can be canonically identified with another law on X^^'^'^ , simply be- 

cause (j) itself gives an identification of [k]'^ with image((/)). In this case we will 
write (f)*fi for this new law on X^^^'^ . 

Borrowing some notation from ergodic theory, a factor of a law /u on (Xl'^l" , S''^]" ) 
will be a Borel map (j) '■ {X, S) — > {Y, ^) to some other standard Borel space 
(Y, <^). To such a map we can associate its inverse-image cr-subalgebra 0^^(<I>) < 
S, and it standard that in the category of Borel spaces, given a Borel probability 
measure on fi any a-subalgebra of S agrees with the inverse-image o--subalgebra 
of some factor (j) up to modifying by negligible sets (see, for example. Chapter 
2 of Glasner (Ml). To such a map (p we associate the map 
corresponding to the coordinate-wise action of (f>, and will refer to (^['^l")^^ as 
the associated factor law of fi. In the opposite direction, if arises from a factor 
of some 'lai-ger' law A via the factor then we will refer to A as an extension of fi 
through (f>. 

An inverse system of laws comprises an inverse system of standard Borel spaces 

+ ,(m + l) (m) (1) 

••• ^ l-^(m+l)) ^(m+l)j > l-^(m)>^(m)j ^ ■•• ^ 1-^(0) ; ^(0)J 

together with a sequence of laws /i(^) on (xf]^ , Y!^^^^ ) such that ((V-J™)^^^ ) ''"'^^ 
[i{rn) for every m. In this case we will define 

^(fc) ^{k) ° ^(k+l) ° ° ^(m-l) 

for k < m, and will sometimes write instead 

,{m+2) + 

as a shorthand to denote this overall situation. 

Given an inverse sequence as above, then exactly as in standard ergodic theory 
(see, for example. Examples 6.3 of Glasner lfT4l ) we can construct an inverse limit 
in the form of a standard Borel space {X(^^-^ , S(oo))> ^ law /U(oo) on {X^^~^ , ) 

and a family of factors ipi^m) '■ ^(oo) ~^ ^{m) such that V'(fc) = V'(fc)'* ° V'(m) for all 

k < m and ((''A(fc))''^^")#^(oo) = M(»n) for every m. We will use this construction 
later in the paper. 

Related to the notion of a factor is that of a 'coupling': given laws jj. and v on 
(Xl^]"^, S'^I'^l") and (yl'^'l", ^^I'^l") respectively, a coupling of fi and is a law A 
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on ((X X y)W", whose coordinate projections onto (X^", 

and (y (I)®!'^]'^) are and respectively. This definition generalizes to cou- 
plings of larger collections of laws in the obvious way. We will also have need 
for a topology on couplings, set up exactly analogously with the 'joining topology' 
of ergodic theory: quite generally, given a countable collection of standard Borel 
probability spaces (Xj, Sj, the space C of all couplings of the on the 
product standard Borel space (Hie/"'^*' ®iei endowed with the weakest 

topology with respect to which all the evaluation maps 




for collections fi E L°°{iii) indexed by finite subsets F C / are continuous. 
Just as for joinings of probability-preserving systems (as discussed in Chapter 6 of 
Glasner [14]), the restriction here to couplings of fixed one-dimensional marginals 
(rather than arbitrary probability measures on the product space) gives that this is 
a compact topology on C. 



4 Strongly stationary laws 

We now introduce the special class of laws that will concern us through most of 
this paper. These are distinguished by satisfying a kind of 'self-similarity' in terms 
of the structure of the index set [k]^ . 

Definition 4.1 (Strong stationarity). A law ^ on T\^^'^) is strongly station- 

ary (s.s.) if (j)*fi = fi for every subspace 4> '■ [k]^ ^ 

This can be thought of as the analog appropriate to the present setting of the ex- 
changeability of a family of random variables (or, equivalently, their joint distri- 
bution) under an index-set-permuting action of some countable group: see, for 
example. Section 2.2 of |6|, where this abstract definition is set up before being 
applied to exchangeable arrays (or 'exchangeable random hypergraphs', as they 
are formulated there). 

Indeed, the only real difference between the settings of that paper and this is that 
here our notion of strong stationarity refers to a semigroup of noninvertible self- 
maps of the underlying index set, for which it seems difficult to find any 'invert- 
ible model'. Furstenberg and Katznelson meet the same difficulty in their original 
work, and circumvent it by relying instead only on a weaker symmetry to which 
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they can associate (using a highly arbitrary selection procedure) a collection of in- 
vertible probability-preserving transformations. By contrast, we will find that this 
noninvertibility is of no consequence for our approach below. 

Let us next recall Furstenberg and Katznelson's reduction to the case of s.s. laws, 
contained in Sections 2.3 and 2.4 of [13]. 

Lemma 4.2. IfTheorem \3.1\ holds for all s.s. laws for every 5 > then it holds for 
all laws for every 5 > Q. 

Proof We only sketch the argument, referring the reader to ifTSll for the details. 
The point is that by applying the Carlson-Simpson Theorem fSl to arbitrarily fine 
finite coverings of the finite-dimensional spaces of probability distributions on 
{0, for increasingly large n, we obtain a subspace V : ^ [^Y ^^'^ 
an infinite word t« € [A;]^ such that the restricted laws 

converge to a strongly stationary law as m oo, and since all one-dimensional 
marginals of the input law gave probability at least 5 to {1}, the same is true of the 
limit. Finally, the condition 

/i{x G {0, : = 1 Vi < A:} > 

is also finite-dimensional and open for this topology on the space of finite-dimensional 
distributions, so if it holds for the limit measure it must also hold somewhere for 
the original measure. □ 

Definition 4.3. If the law n is s.s. then in particular all the one-dimensional 
marginals (7ruj)^/x/or w € [k]^ are the same and all the k-dimensional marginals 

(7r£(i),vr£(2), . . . ,vr^(fe))#/x 

for I a line in [fc]"^ the same. We will refer to these as the point-marginal and 
line-marginal of and will often denote them by fi° and ^u^™^ respectively. 



5 Partially insensitive and sated laws 

Definition 5.1 (Partially insensitive cr-algebras). For any nonempty e C [k] and 
a s.s. law fi on (x''^']'^, S'^t'^l'^) the e-insensitive a-algebra is the a-subalgebra 
$e ^ 5] defined by 

$e := {A G S : lA{xi(i)) = lA(2;^(j)) Vz, j G efor ^-a.e. (x^)^ G X^"}. 
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The e-insensitive a-algebms for different sets e are together referred to as the 
partially invariant a-algebra. A measurable function f on X is e-insensitive if it 
is ^e-measurable. 

The law // is itself e-insensitive if^e = S> that is ifx^f^i-^ = x ^^j^ for every i,j £ e 
for fi-a.e. {xw)w £ X^'^^'^ . 

We now also construct a larger collection of cr-algebras from the above, but first 
must set up some additional notation. These next cr-algebras will be indexed by 
up-sets in (^l)' that is, famihes 1 C (I^^) such that if n G T and [k] ^ v ^ u 
then also t; G T. For example, given e C [k] we write (e) := {u G (1^^) : n ^ e} 
(note the non-standard feature of our notation that e G (e) if and only if |e| > 2): 
up-sets of this form are principal. We will abbreviate ({«}) to {i). 

In general, for any up-set 1 C we let <I>x := VegJ*^e- It is clear from the 
above definition that if e C e' then <I>e 5 ^e', so we have ^>e = $(e)- 

It is also immediate from the above definition that for any s.s. law /i, e G 

and i, j G e the fi-subalgebras vr"^ (<I>e) and vr"^ (<^>e) of S®'^' are equal up to /z'"^'^- 
negligible sets, and so we can make the following definition. 

Definition 5.2 (Oblique copies). For each e C [A;] we refer to the common /i'™*^- 
completion of the a-subalgebra 7r~^(<I>e), i £ e, as the oblique copy of <^e, cif^d 
denote it by $J. More generally we shall refer to a-algebras formed by repeatedly 
applying fl and V to such oblique copies as oblique cr-algebras. 

Clearly if a law is e-insensitive for some e this amounts to a nontrivial simplifi- 
cation of its structure. In general we will analyze an arbitrary law in terms of its 
possible couplings to insensitive laws through the following definition. 

Definition 5.3 (Sated laws). For a nonempty up-set X C and a s.s. law p 
on {X^^^"^ , E®!*']") with partially insensitive a-algebras <I>e, p is I-sated if for any 
s.s. extension of ji the factor tt : x ^ x and the a-subalgebra $1 are relatively 
independent under ji° over the a-subalgebra 7r^^($j). 

The law /i is fully sated if it is I-sated for every such I. 

Clearly not all laws are sated, but it turns out that we can recover the advantage 
of working with a sated law by passing to an extension. The following theorem is 
closely analogous to a similar 'satedness' result to appear in @, and is also closely 
related to older results on 'pleasant' and 'isotropized' extensions of probability- 
preserving systems in |l3l|5l. 
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Theorem 5.4 (Sated extension). Every s.s. law has a fully sated s.s. extension. 

In light of tills it will suffice to prove Theorem 13. II for fully sated s.s. laws /x. We 
will finish this section by proving Theorem 15.41 and then in the next section we 
will derive some useful consequences of full satedness for the structure of be- 
fore using these to complete the reduction of Theorem l3.1l to an infinitary removal 
lemma in Section |7] 

Lemma 5.5 (Partially sated extension). For any up-set X C every s.s. law // 
has an s.s. extension that is I -sated. 

Proof This proceeds by an infinitary 'energy increment' argument: we build a 
tower of extensions of fi each 'closer' to e-satedness than its predecessor and so 
that the resulting inverse Umit is exactly T-sated. 

Let (/r)r>i be acountable subset of the -unit ball {/ G L°°(/U°) : ||/||oo < 1} 
that is dense in this ball for the L^-norm, and let (rj)j>i be a member of in 
which every non-negative integer appears infinitely often. 

We will now construct an inverse sequence 

, (m+2) ,(m+l) , (m) 

... > l-^(m+i),^(m+i),/^{m+l); ^ \^{m)^^{m) '/^(m)/ ^ ••• 

Starting from (X(o) , S(o) ) = (X, S) and /X(o) = /i such that each (^{Jjl+i) , ^^^li) , 

is obtained by coupling to {X^^-^ ^'^fm) '/^(m)) a new law on some ((X')W", (T,')^ 
such that for this new law we have S' = <I>j. 

Suppose that we have already obtained [X^^-^ , 5]^^^^ , lJ-{m)) for < m < mi. 
We consider two separate cases: 

• If there is some further extension 
such that 

then choose a particular such extension such that the increase 
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is at least half its supremal possible value over such extensions. Now by 
restricting to the possibly smaller extension of ,Ai(mi)) given 

by replacing {X, S) with its factor generated by vr and the cr-algebra $x, 
we may assume that /i is itself obtained as a coupling of ^(mj) to a law jj.' 
for which the u-algebra $j is full, and now we let {X(^^^_^_i^, T,(^^^_^_i^) := 

{X, S), fi(^rm+i) ■■= fj- and Vj™')"*'^^ := vr. 
• If, on the other hand, for every further extension tt as above we have 

then we simply set ^'^21^^^ := idx^m^y 
Finally, let {X(^^^,'E(^^^, /j,'^^^) be the inverse limit probability space of 

^(m+1) , „ „ o \ / y V ° ^ ('"-I) 

... > {JL(^m+l),^{m+l),f^lm+l)) *■ l-^(m) ) ^(m) ) M(m) J ^ 

^(oo) the inverse limit of the measures /i(„j-) and ^/'(m) : -'^(oo) ^ -'^(m) the resulting 
factor maps. It is clear from the above construction that the whole a-algebra S(oo) 
is generated up to /x^^^ -negligible sets by ^(oo),x and V'(o)' since ^(oo),x contains 
every 'i/'(^)(^(m),x)- To show that /x^oq) is X-sated, let p, under tt : X — > X^^^ be 
any further extension of /Lt(oo), and suppose that / G L°°(/X(oo))- We will complete 
the proof by showing that 

E^io (/ o TT I $x) = E^o_^^ (/ I $(oo),x) O TT. 



By construction, this / may be approximated arbitrarily well in by finite 

sums of the form J2p9p ' with gp being bounded and $(00) j-measurable and 
hp being bounded and V'(o) "Measurable, and now by density we may also restrict 
to using hp that are each a scalar multiple of some frp o tp^^Q^ , so by continuity and 
multilinearity it suffices to prove the above equality for one such product g ■ (fr o 
V'(o))- Since g is $(oo),x-nieasurable and ix ^ 7i"~^(^(oo),x)' this requirement 
now reduces to 

Ea° ifr O -0(0) o vr I ix) = E^o^^ {fr o V'(o) I $(oo),x) « TT, 

and this in turn will follow if we only show that 

l|E/io(/r oV'(O) 07r|$x)||i = oV'(O) I^(oo),x)ll2- 
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Now, by the martingale convergence theorem we have 

(fr ° I '^mM T W^.lJfr o V'(O) I <J>(oo),l)lli 

as m ^ oo. It follows that if 

||E/i°(/r o V(0) 7r|$i)||2 > l|E^° Xfr°1p{0) |^{oo),j)ll2 

yoo) 

then for some sufficiently large m we would have rm = r (since each integer 
appears infinitely often as some r^) but 

< IIE^°^,(/. o V'(o) I <i>(oo),i)ll^ - l|E;.o^^ (/. o ^f;j) I 

and also 

||E^o(/, o V'f^^j) o vr I > WE^.Jf o V^f-) ! ci>(^)^^)||2 + 2— 

so contradicting our choice of /i(m+i) in the first alternative in our construction 
above. This contradiction shows that we must actually have the equality of L^- 
norms asserted above, as required. □ 

Proof of Theorem 15.41 Pick a sequence of up-sets {Zrn)m>i in which each possi- 
ble up-set appears infinitely often. Now by repeatedly implementing the preceding 
lemma we can form another tower of extensions 

above (Xl'^l" , , /x) in which every fi^"^^ is Xm-sated. It is now an immediate 

check that the resulting inverse hmit (Xl'^]" , , /i) is fully sated. □ 



6 The structure of sated laws 



Having proved the existence of sated extensions, we will now show how the struc- 
ture of fi (and particularly of the partially insensitive cr-algebras $e) simplifies for 
sated systems, before using these results to prove Theorem l3.1l in the next section. 

First we need the following lemma. 
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Lemma 6.1. If jx is fully sated then for every i E e e {^l), if f E L°°{fi°) is 
e-insensitive then 

Em°(/| V %i})=EM°(/| V ^eUO})- 
je[k\\e je[fe]\e 

Proof Clearly 

Em"(/| V ^eUO-}) 

iG[fc]\e 

is always ( VjG[A;]\e ^{ij}) -measurable. It will therefore suffice to show that if / G 
L°°{lJ,°) is e-insensitive and orthogonal to the cr-algebra Vje[A;]\e ^eu{j} then it is 
actually orthogonal to Vje[fe]\e ^{«J}- prove this by contradiction, so suppose 
for one such / that we could find some bounded functions hj for j e [A;] \ e such 
that each hj is -measurable and 



je[k\\e 

Re-writing this inner product condition at the level of the whole law /x it simply 
reads that 

/ f{xw)- W hj (xu,) fiidx) = K 

je[k]\e 

for any fixed w € [k]'^. However, now we apply first the e-insensitivity of / to 
deduce that also 



/ f{Xw)- n ^j(^r-eiH)/^(dx) 



je[k]\e 



je[k]\e 

for every word w, and now the {i, j}-insensitivity of hj to deduce that 



/ f{xw)- n ^i(^re,H)y"(dx) 

j&[k]\e 

= f{Xw)- n ^j(^reiH)M(dx) = K. 

jG[fcl\e 

for every word w. 



13 



It follows that if we define the probability measure Aon (XxXW\^)W"to be the 
joint law under fi of 

then all of its coordinate projections onto individual copies of X are still just fi°, 
the projection 

has 'K^\ = 11 and the projections 



• (yui) (^jr,to)jre[A;]\e 



are A-almost surely (e U {j}) -insensitive. Therefore through the first coordinate 
projection vr the law A defines an extension of ^u, and the above inequality gives a 
non-zero inner product for / with some product of (e U {j}) -insensitive functions 
under A over j € [A;] \ e that we can express as 



/ (/ ° '''")■ 11 {hj O TTj) d\ = K. 



je[fc]\e 



Now A may not be stationary, but at least its marginals onto all individual copies of 
X in (X X are equal to It follows that we can re-run the appeal to 

the Carlson-Simpson Theorem in Lemma l4!2] to obtain a subspace : [k]^ ^ 
and an infinite word id € [A;]^ such that the pulled-back measures 

converge in the coupling topology on {X x X^^^\'^)^^^'^ (recall that for couplings 
of fixed marginals this is compact) to a s.s. measure jl. Since ^ was already 
strongly stationary, we must still have vr^/i = /i, and by the definition of the 
coupling topology as the weakest for which integration of fixed product functions 
is continuous it follows that we must still have, firstly, that 



/ (/ O Tt) • TT {hj O TTj) d/i = K, 



je[k]\e 

and secondly that the coordinate projections ttj are (e U {j}) -insensitive under 
jl, since this is equivalent to the assertion that for any A G Tj, i G e and line 

£:[k]^ [kf we have 

'^A{zj/(i)) ■ lx\A{zj/{j)) jl{dz) = 

(XxXW\<=)W" 
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and this is clearly closed in the coupling topology. 

Therefore we have found a s.s. extension jloi ji through some factor map ^ under 
which 

je[k]\e 

By satedness, it follows that in fact 

contradicting the condition that / be orthogonal to this cr-algebra. □ 

Example The idea behind the above proof may be made clear by an explication 
of the special case A; = 3, i = 2 and e = {1,2}. In this case we wish to prove that 
if /Li is a fully sated s.s. law on X^'^]"" and / € L°°{fi°) is {1, 2} -insensitive then 

Em°(/I^{2,3}) = 1^,2,3}), 

and so we suppose that the right-hand side above is zero and prove that the left- 
hand side is also zero. Arguing by contradiction, we suppose otherwise and in this 
case let 

h:= E^o(/|${2,3})- 

As a ${2,3}"m6^surable function, this h must be {2, 3}-insensitive, and so the con- 
dition h^Q impUes (from the definition of h) that 



Jx Jx 



X 

so we have obtained a nontrivial inner product between the {1, 2} -insensitive func- 
tion / and the {2, 3} -insensitive function h. We wish to deduce from this that / 
actually has a non-zero inner product with some {1,2, 3} -insensitive function. For 
a general s.s. law jjL this does not follow, but using in turn the strong stationarity of 
)U, the {1, 2}-insensitivity of / and then the {2, 3}-insensitivity of h we can write 

0/ / fhdn°= f{xw)h{xyj) = / f{xr^^(yj))h{xr^^{yj))l^{d^) 

Jx Jxw^ Jxikr ' ' 

= / fixu,)h{xr,^(^n^))n{dy:) 

= / /(x^)/i(x^3 2(ri2H))/^(dx) 

Jxw^ 



/ fiXyj)h{x222-2) Kdx) 

JxM" 
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for any w € [k]'^, where 22 • • • 2 has the same word-length as w, and this implicitly 
defines a non-trivial coupling of /i to a process that is indexed by {2}'^ C [3]'^ and 
which can now be re-interpreted simply as a {1, 2, 3} -insensitive law. Applying the 
Carlson-Simpson Theorem to construct from this a similarly nontrivial coupling 
that is itself s.s. gives a contradiction with the additional condition that jj, be 1- 
satedfor J= {1,2,3}. < 

The usefulness of satedness for proving Theorem 13.11 will rest on the following 
property. 

Theorem 6.2. If e Q [k] is nonempty, fi is fully sated and fi G L°°{p°) for i G e 
then 

Proof We will prove this by contradiction, assuming that the desired equality 
fails for some choice of fi E L°°{ii°) and constructing from this an extension of 
fi witnessing that it is not sated. For convenience let us temporarily write := 

V/ee\{*} ^{*,«} (so Si = when e = [k]). 

Indeed, given such fi we can write 



j£e ^ iGe,i<j iSie,i>j 

and so if this is nonzero then there is some choice of j G e for which 

J^,{ n /*°vr,)-(/,ovr,-E^o(/^.|H,)o^,)-( H E^o (/, | 5^)0^^) d/^ ^ 0. 



Now for each i G e \ {j} recall that Vj^i : [k]^ —>■ [k]'^ is the letter-replacement 
map defined by 

i if Wm= j 

Wm else. 



In view of the strong stationarity of fi, the above inequality implies that 



i€e,i<j i£e,i>j 
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for any w G [k]^ such that w ^{j} / 0, since then the points rj^siw) for s = 
1,2, ... ,k form a combinatorial line. 

It follows that if we define the probability measure Aon (XxX^\i-'>)W"to be the 
joint law under fi of 

then all of its coordinate projections onto individual copies of X are still just 
the projection 

'• {Vw) (,^i,w)i<j 1 (^i,-u))j>j)y, ^ {yw)w 

has ir^X = and the projection 

for iQ G e \ {j} is A-almost surely {io, i}-insensitive. Therefore through the first 
coordinate projection vr the law A defines an extension of /u (not necessarily s.s.), 
and the above inequality gives a fixed non-zero inner product for / with some 
product of {i, j} -insensitive functions under A over i G e \ {j}. Arguing exactly as 
for Lemma |6?T] we obtain the same kind of correlation with some s.s. extension fl 
of /X through some factor map ^, and so in light of the above nonvanishing integral 
we have 

E^o(/^.o^-E^o(/^.|H,)o^|Hj)^0. 
By satedness, it follows that in fact 

E^o(/,.-E^o(/,.|H,)|H,)/0, 

manifestly giving the desired contradiction. □ 

Remark Essentially, it is the use of the letter-replacement maps in the above 
proof that has been brought to the present paper from the online project [21 J. This 
idea was brought to my attention during discussions with Terence Tao, a more 
active participant in that project. <l 

We can now give our main structural result for sated laws. 

Theorem 6.3. If fi is a fully sated law and 2,1' C (\^^ are two up-sets then the 
oblique a-algebras and are relatively independent over ^j^-^ji under /x'™*^. 

Remark This result together with Theorem 15.41 amount to our analog for s.s. 
laws of the representation theorems for partially exchangeable arrays ( 1 18.1 ). < 

We will deduce Theorem 16. 3 [ result by induction using the following special case. 
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Lemma 6.4. If fi is a fully sated law, 2 C (1^^) is an up-set and e is a member 
of (IJ'I) \ of maximal size then the oblique a-algebras $| and <I>j are relatively 
independent over *^*|g-)pj ^^^^^ 

Proof Suppose that Fi G L°°(^^'°<=Lt) and F2 G L~(^i'°'=Lt )■ It will suffice 
to show that 

Pick z G e and /i G such that -Fi = /i o vTj /u^^'^-almost surely. 

Let {ai, 02, . . . , flq} be the antichain of minimal elements in T; this clearly gener- 
ates T as an up-set. Since e ^ X must have \ e 7^ for each s < q. Pick 
is & as\e arbitrarily for each s < q,so that = '^i^^i^as ) up to -negligible 
sets. 

Now, since = Vs<q ^o.s> F2 may be approximated arbitrarily well in L^(/i^™°) 
by sums of products of the form Ylip Ws<q ^s,p ° t^is with (pg^p G L°°(^°|$^^ ), and 
so by continuity and linearity it suffices to assume that F2 is an individual such 
product term. This represents F2 as a function of coordinates in indexed only 
by members of {11,12, ■ ■ ■ , iq} C [k] \ e, and now we appeal to Theorem 16.21 to 
deduce that 

line 



/ -^1 • IT <Ps,p O TTi^ dfJ. 
s<q 

= j^^ (/i I V ° ■ n ^'^p ° 



line 

Fs,p ^ 'Hs ^H- 

je[fc]\e s<q 

However, now Lemma |6TT] and the fact that fi is already <l>e -measurable imply that 

V ^{M}) = (/l I V "^eum), 

je[k]\e je[k]\e 

and since each e U {j} G I (by the maximality of e in V[k] \ under vTj this con- 
ditional expectation must be identified with E^iino(Fi [ *^*jpi^gp> as required. □ 

Proof of Theorem 16.31 We fix 2 and prove this by induction on T'. If I' C 2 
then the result is clear, so now let e be a minimal member of \ Z of maximal 
size, and let 2" := 2' \ {e}. It will suffice to prove that if F G L°°(/ii'°^L t ) then 



E.Iinc(F|4) = E^,.„e(F|cI>t 



xra'h 
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and furthermore, by approximation, to do so only for F that are of the form Fi • F2 
with Fi G L°°(^'''^'=Lt ) and F2 € L°°(^'''^'=L t )■ However, for these we can 
write 

= E^n„e(E^n„c(Fi|4^j„)-F2|4), 

and by the preceding lemma 

E^iinc(Fi j ^-^ux") = i ^(iuJ")n(e))- 

On the other hand {T U T") n (e) C X" (because T" contains every subset of [k\ 
that strictly includes e, since Z' is an up-set), and so the preceding lemma promises 
similarly that 

E^line(Fl I ^(Jul")n(e>) = E;,line(-Fl | ^z, ) • 

Therefore the above expression for E^unc {F \ <I> j) simplifies to 

E^nne(E^nne(Fl [ 4„ ) • F^ \ 4) = E^Une ( E^Unc (Fl • F^ \ 4„ ) | 4) 

= E^n„o(E^nne(F I 4„) I 4) = E^nne(F | 4^^,, ) = E^Une (F | 4^^, ) , 

by the inductive hypothesis applied to T" and T, as required. □ 

7 The Density Hales- Jewett Theorem for sated laws 

Since it is clear that the assertion of Theorem 13 . 1 1 holds for a s.s. law if if holds for 
any extensions of that law, by Theorem 15 ^l it suffices to prove Theorem [37T] in case 
/i is fully sated. 

In this case Theorem 16.21 and Theorem [63] together give quite a detailed picture of 
the joint distribution of the factors $j under and it turns out that this structure 
is enough to enable a proof of that theorem along the same lines as for the multidi- 
mensional Szemeredi Theorem in 13). In particular, Theorem l3.1l now follows from 
an 'infinitary removal lemma' essentially identical to that used in [3| (Proposition 

6.1 of that paper), which was in turn based on Tao's 'infinitary hypergraph removal 
lemma' in [24|, with some modifications to fit the context of a proof of multiple 
recurrence. The version we will use below is lifted almost verbatim from [3|, and is 
amenable to an identical proof from Theorem 16. 3 1 as for that result from Corollary 

5.2 of 1 3], so we only state the result here. 
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Propoisition 7.1 (Infinitary removal lemma). Suppose that /i'™'' is the line marginal 
of a s.s. law fi, and so has the structure described by Theorem \6.3\ and that Zi j 
for i = 1, 2, . . . , d and j = 1,2, ... ,ki are collections of up-sets in (l^l) such that 
[k] G 2j j C (i) for each and suppose further that the sets Aij G , are 
such that 

"""(rKri-^u))""- 

i=i j=i 



Then we must also have 



d ki 



i=ij=i 

□ 



This is proved by an induction on a suitable ordering of the possible collections of 
up-sets appealing to a handful of different possible cases at different steps 

of the induction, closely related to the induction on edge-size that underlies the 
proof of the simplex removal lemma from the finitary hypergraph regularity lemma 
(see, for example, Gowers [15] or Nagle, Rodl and Schacht [20]). This inductive 
proof is the reason for the above statement in terms of arbitrary collections of up- 
sets, but we will need only the special case ki = 1, 2j i := (i) for the proof of 
Theorem 13. II 

Proof of Theorem 13.11 from Proposition 17.11 As remarked above it suffices to 
prove Theorem [STT] for a sated law fi. Given such a law, suppose that A G S is such 
that iJ'^^'iA^) = 0. Then by Theorem I6.2l we have 

^line(^fc) = /■ rr E^o (U I 1>(,,)) O TTi d/°<= = 0. 

Now the level set Bi := {E^o (1^ | > 0} lies in ^(i), and the above equality 
certainly implies that also fj}^^'^{Bi x B2 x ■ ■ ■ x B^) = 0. Now, on the one hand, 
setting ki = 1, Xi^i := (i) and Ai^i := Bi for each i < d, Proposition 17. 1 1 tells us 
that n{Bi n i?2 n • • • n B^) = 0, while on the other we must have iJ,{A \Bi) = 
for each i, and so overall ij.{A) < ^{Bi n n • • • n S^) + Yh^^ ^i{A \ Bi) = 0, 
as required. □ 
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